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Continuing our earlier work ( quant -ph/0401060 I, we give two alternative proofs 
of the result that a noiseless qubit channel has identification capacity 2: the first 
is direct by a "maximal code with random extension" argument, the second is 
by showing that 1 bit of entanglement (which can be generated by transmitting 1 
qubit) and negligible (quantum) communication has identification capacity 2. This 
generalises a random hashing construction of Ahlswede and Dueck: that 1 shared 
random bit together with negligible communication has identification capacity 1. 
We then apply these results to prove capacity formulas for various quantum feed- 
back channels: passive classical feedback for quantum-classical channels, a feed- 
back model for classical-quantum channels, and "coherent feedback" for general 
channels. 



1 Introduction 

While the theory of identification via noisy channeliPEI has generated sig- 
nificant interest within the information theory community (the areas of, for 
instance, common randomness pi channel resolvability^l and watermarking^! 
were either developed in response or were discovered to have close connections 
to identification), the analogous theory where one uses a quantum channel 
has received comparably little attention: the only works extant at the time 
of writing are Lober's starting of the theoryj21 a strong converse for discrete 
memoryless classical-quantum ch ann els by Ahlswede and Winter,^ and a re- 
cent paper by the present author!^ 

This situation may have arisen from a perception that such a theory would 
not be very different from the classical identification theory, as indeed classical 
message transmission via quantum channels, at a fundamental mathematical 
level, does not deviate much from its classical counterpart i l^^^^ l coding 
theorem and converses are "just like" in Shannon's classical channel coding 
theory, with Holevo information playing the role of Shannon's mutual infor- 
mation. (Though we have to acknowledge that it took quite a while before 
this was understoo d, a nd that there are tantalising differences in detail, e.g. 
additivity problems!^) 

In our recent workp^ however, a quite startling discovery was m ade: it 
was shown that — contrary to the impression the earlier paperi^^!! gave — 
the identification capacity of a (discrete memoryless, as always in this paper) 
quantum channel is in general not equal to its transmission capacity. Indeed, 
the identification capacity of a noiseless qubit was found to be 2. This means 
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that for quantum channels the rule that identification capacity equals common 
randomness capacity (see the discussion by Ahlswede^ and Kleinewachten^j) 
fails dramatically, even for the most ordinary of channels! 

In the present paper we find some new results for identification via quan- 
tum systems: after a review of the necessary definitions and known results 
(section and a collection of statements about what we called "random 
channels" in our ealier paper j^ 1 we first give a direct proof that a qubit has 
identification capacity 2, in section QJ (Our earlier proofs uses a reduction 
to quantum identification, which we avoid here.) Then, in section^ we show 
the quantum analogue of Ahlswede and Dueck's result that 1 bit of shared 
randomness plus negligible communication are sufficient to build an identifica- 
tion code of rate 10 namely, 1 bit of entanglement plus negligible (quantum) 
communication are sufficient to build an identification code of rate 2. In sec- 
tion [B] we briefly discuss the case of more general prior correlations between 
sender and receiver. 

In section we turn our attention to feedback channels: we first study 
quantum-classical channels with passive classical feedback, and prove a quan- 
turn generalisation of the capacity formula of Ahlswede and DueclM Then, in 
section [SJ we introduce a feedback model for general quantum channels which 
we call "coherent feedback" , and prove a capacity formula for these channels 
as well which can be understood as a quantum analogue of the feedback iden- 
tification capacity of Ahlswede and Dueck.EI We also comment on a different 
feedback model for classical-quantum channels. 



2 Review of definitions and known facts 

For a broader review of identification (and, for comparison, transmission) 
via quantum channels we refer the reader to the introductory sections of our 
earlier paperpHlto Lober's Ph. D. th esisP"U and to the classical identification 
papers by Ahlswede and Dueck.EEl Here we are content with repeating the 
bare definitions: 

We are concerned with quantum systems, which are modelled as (finite) 
Hilbert spaces H (or rather the operator algebra B(H)). States on these 
systems we identify with density operators p: positive semidefinite operators 
with trace 1. 

A quantum channel is modelled in this context as a completely postive, 
trace preserving linear map T : B(Hi) — ► B(H.2) between the operator alge- 
bras of Hilbert spaces Hi , H2 ■ 

Definition 1 (L6berj2U Ahlswede and Winter**) An identification 
code for the channel T with error probability Ai of first, and A2 of second 
kind is a set {(pi, Di) : i = 1, . . . , N} of states pi on Hi and operators Di on 
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7^2 with < Di < 1, such that 

Vz Tr(T( Pi )Di) >1-Ai, 

Vt^j Tr(r( Pi )^) < A 2 . 

-For i/ie identity channel id^d of the algebra B(C d ) of a d-dimensional system 
we also speak of an identification code on C d . 

For the special case of memoryless channels T®" (where T is implicitly 
fixed), we speak of an (n, Ai,A2)-ID code, and denote the largest size N of 
such a code N(n, Ai, A2) ■ 

An identification code as above is called simultaneous if all the Di are 
coexistent: this means that there exists a positive operator valued measure 
(POVM) (£ fe )f=i and sets V l <Z {!,..., K] such that D t = Y,k e v % E k- The 
largest size of a simultaneous (n, Ai, \2)-ID code is denoted N s - lnl (n, Ai, A2). 

Most of the current knowledge about these concepts is summarised in the two 
following theorems. 

Theorem 2 (LoberpB Ahlswede and WinteJSl) Con sider any chan- 
nel T, with tr ansmission capacity C(T) (Holevo,^ Schumacher and 
Westmoreland^). Then, the simultaneous identification capacity of T , 

C slm _ ID (T) := inf liminf - loglogiV sim (n, Ai, A 2 ) > C(T). 

Ai,A 2 >0 n^oo n 

(With log and exp in this paper understood to basis 2.) 

For classical-quantum (cq) channels T (see HolevU^), even the strong 
converse for (non-simultaneous) identification holds: 

Cb(T)= lim iloglogiV(n,Ai,A 2 ) = C(T), 

n — >oo 77, 

whenever Ai, A2 > and Ai + A2 < 1. □ 

That the (non-simultaneous) identification capacity can be larger than the 
transmission capacity was shown only recently: 
Theorem 3 (WinterEHl) 

The identification capacity of the noiseless qubit 
channel, ides, is CiD(idc2) = 2, and the strong converse holds. □ 

The main objective of the following three sections is to give two new proofs 
of the achievability of 2 in this theorem. 

3 Random channels and auxiliary results 

The main tool in the following resu lts (as in our earlier papeJ^SI) are random 
channels and in fact random states! ' • 

Definition 4 For positive integers s,t,u with s < tu, the random channel 
Rs is a random variable taking values in quantum channels B(<C S ) — ► 
B(C l ) with the following distribution: 
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There is a random isometry V : C s — ► C (g> C u , by which we mean a 
random variable taking values in isometries whose distribution is left- /right- 
invariant under multiplication by unitaries on C* ® <C u /on C s , respectively, 
such that 

R t M(p) = Tr c «(VpV*). 

Note that the invariance demanded of the distribution of V determines it 
uniquely — one way to generate the distribution is to pick an arbitrary fixed 
isometry Vq : C s — > C ® C u and a random unitary U on C ® C" according 
to the Haar measure, and let V = UVq . 

Remark 5 Identifying C tu with C*<gsC", we have i?' (u) = Tro o#* u(1) . Note 
that is a random isometry from C s into C 4 in the sense of our definition, 

s(l) 

and that the distribution of R s is the Haar measure on the unitary group 
ofC s . 

Remark 6 The one-dimensional Hilbert space C is a trivial system: it has 
only one state, 1, and so the random channel J2* is equivalently described 
by the image state it assigns to 1, 11*1 (1). For s - 1 we shall thus identify 
the random channel R*^ with the random state on C*. A different 

way of describing this state is that there exists a random (Haar distributed) 
unitary U and a pure state -00 such that i?j ' = Tro (UipoU*) — note that 
it has rank bounded by u. These are the objects we concentrate on in the 
following. 

Lemma 7 (see Bennett et a/.pl WinteJ^l) £ei ip be a pure state, P a 
projector of rank (at most) r and let U be a random unitary, distributed ac- 
cording to the Haar measure. Then for e > 0, 

Pr \tt{U^U*P) > (1 + e)^} < cxp (-r * - ^ + ^ 

For < e < 1, and rankP = r, 

Pr | Tr (UipU* P) > (1 + e)^| < exp 

Pr {lr(UipU*P) < (1 - e)^| < exp 
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□ 

Lemma 8 (Bennett et al&) Fore > 0, there exists in the set of pure states 
on C d an e-net M. of cardinality \M\ < (-) ; i.e., 

V</j pure 3ip e M \\ip — (p\\i < e. 

□ 

With these lemmas, we can prove an important auxiliary result: 
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Lemma 9 (see Harrow et al\^ty For < i] < 1 and t < u, consider the 



random state R\ on C* . Then, 



Pr \ R\ {u) £ 



1 — 77 1 + V 



< 2 



lOt 
V 



exp 
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Proof. We begin with the observation that g [al; f3t] if and only if for 

all pure states (rank one projectors) <p, 



Tr(^"V)=Tr(J?^^(^®l M )) 



tu(l) , 



><*, 

<(3- 



Due to the triangle inequality, we have to ensure this only for ip from an 
?7/2t-net and with a = (l - ■§) /t, (3 = (l + §) ft. Then the probability 
bound claimed above follows from lemmas and [SI with the union bound. □ 



4 ID capacity of a qubit 

Here we give a new, direct proof of theorem|31 — in fact, we prove the following 
proposition from which it follows directly. 

Proposition 10 For every < A < \, there exists on the quantum system 
B(C d ) an ID code with 



N = 



2 6XP \ V 3000 log d 



messages, with error probability of first kind equal to and error probability 
of second kind bounded by A. 

Proof. We shall prove even a bit more: that such a code exists which is of the 
form {(pi, Di) : i = 1, . . . , N} with 



Di = supp pi , rank pi 



loge? 



pi < ~^—Di 



(1) 



The constants a < A/4 and 77 < 1/3 will be fixed in the course of this proof. 
Let a maximal code C of this form be given. We shall show that if N is "not 
large", a random codestate as follows will give a larger code, contradicting 
maximality. 

Let R — Rf"^ (the random state in dimension d with 5-dimensional 
ancillary system, see definition 0} , and D := suppi?. Then, according to the 
Schmidt decomposition and lemma El 



Pr<^7?^ 



1 — 77 1 + 71 



Pr I R{ {d) $ 



1 — 77 1+77 



< 2 



IOA 



2 ( ) 



exp — d 
77 j 1 \ 24 In 2 



(2) 
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This is < 1/2 if 



d > — J— log— SlogS, 



which we ensure by choosing a < A ^ 96 t |" 2 log < A/4. 

In the event that —p-D < R < ^-p-D, we have on the one hand 

Ty( Pi D) < Tr n^Di^—Rj < 2Tr{RD l ). (3) 

On the other hand, because of R,f^ = Tv£6Rf S<,1 \ we can rewrite Tr(RDi) — 
Tr(i?f <g> l s )), hence by lemmaQ 

Pr{Tr(i?A) > A/2} < cxp(-S 2 ). (4) 
So, by the union bound, eqs. © and (@J yield 

Py\C(J{(R,D)} has error probability of 

second kind larger than A or violates eq. Q j < — + iVexp(— <5 2 ). 

If this is less than 1, there must exist a pair (R, D) extending our code while 
preserving the error probabilities and the properties of eq. which would 
contradict maximality. Hence, 

iV> iexp(^), 

and we are done, fixing rj — 1/3 and a — A/3000. □ 

The proof of theorem\^is now obtained by applying the above proposition to 
d — 2", the Hilbert space dimension of n qubits, and arbitarily small A. That 
the capacity is not more than 2 is by a simple dimension counting argument}^ 
which we don't repeat here. □ 



5 ID capacity of an ebit 

Ahlswede and Duecl0 have shown that the identification capacity of any sys- 
tem, as soon as it allows — even negligible — communication, is at least as 
large as its common randomness capacity: the maximum rate at which shared 
randomness can be generated. (We may add, that except for pathological ex- 
amples expressly constructed for that purpose, in practically all clas sical sys - 
tems for which these two capacities exist, they turn out to be equal vH^ |) 
Their proof relies on a rather general construction, which we restate here, 
in a simplified version: 
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Proposition 11 (Ahlswede and Dueck ) There exist, for X > and N > 

4 1 / A , functions : {1, . . . , M} — ► {1, . . . , N} (i = 1, . . . , 2 M ) such that the 
distributions Pi on {1, ... , M} x {1, . . . , N} defined by 

1 otherwise. 

and the sets T>i — supp Pj /orm an identification code with error probability of 
first kind and error probability of second kind X. 

In other words, prior shared randomness in the form of uniformly dis- 
tributed (i £ {1, ...,M} between sender and receiver, and transmission of 
v 6 {1, . . . , N} allow identification of 2 M messages. □ 
(In the above form it follows from proposition 1151 below: a perfect transmis- 
sion code is at the same time always an identification code with both error 
probabilities 0.) 

Thus, an alternative way to prove that a channel of capacity C allows 
identification at rate > C, is given by the following scheme: use the channel 
n — 0(1) times to generate Cn — o(n) shared random bits and the remaining 
O(l) times to transmit one out of N — 2°^ messages; then apply the above 
construction with M — 2 Cn ~ ot - n \ More generally, a rate R of common ran- 
domness and only negligible communication give identification codes of rate 
R. 

The quantum analogue of perfect correlation (i.e., shared randomness) 
being pure entanglement, substituting quantum state transmission wherever 
classical information was conveyed, and in the light of the result that a qubit 
has identification capacity 2, the following question appears rather natural 
(and we have indeed raised it, in remark 14 of our earlier papeJ^SI): Does 
1 bit of entanglement plus the ability to (even only negligibly) communicate 
result in an ID code of rate 2, asymptotically? 

Proposition 12 For X > 0, d > 2 and A > (^9? log ^M) log d, there exist 
quantum channels T t : B(C d ) — ► B(C A ) (i = 1, . . . , N' = [| exp(d 2 )~| j, such 
that the states pi — (id(8Tj)$rf (with state vector \<$>d) — 77^ Ey=i )> an< ^ 
the operators Di = supp^i form an identification code on B(C d ® C A ) with 
error probability of first kind and error probability of second kind X. 

In other words, sender and receiver, initially sharing the maximally en- 
tangled state can use transmission of a A- dimensional system to build an 
identification code with |~^exp(d 2 )] messages. 

Proof. Let a maximal code C as described in the proposition be given, such 
that additionally 

A=supp/3j, rsaikpi=d, pi < + A- (5) 

Consider the random state R — R dA ^ on C dA = C d <g>C A , and D := suppi?. 
Now, by Schmidt decomposition and with lemma (compare the proof of 
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proposition I1U|) . for rj := A/3 
Prii? ^ 



l — n 1 + 7? 

— t^-D: ——!-D 



The very same estimate gives 



VrlRf Ad) 



1-7? 1 



< 2 



lOd 



2d 



exp 



-dA 



24 In 2 



(6) 



1„ 



= Pr<ji?f Ad) 1 



1 — 7? 1+7? 

— r L U\— r L U 



< 2 



lOrf 
V 



2d 



exp 



-dA 



V 
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(7) 



By choosing A > ^ 144 v \ n 2 log logd, as we indeed did, the sum of these two 

probabilities is at most 1/2. 

In the event that ^p-D < R < -jjr^-D, we argue similar to the proof of 
proposition I1UI fcompare eq. ©): 



Tr(p,D) < Tr 



1 + A d 

A- R < 3Tr(i?A) 



(8) 
(9) 



d " 1 — rj 

On the other hand (compare eq. |@J|), 

Pr{Tr(i?A) > A/3} < exp(-d 2 ), 

by lemma[7]and using A -1 < A/6. 

In the event that -'-r 2 ! < Tr C Ai? < ^g 2 !, there exists an operator X on 
C d with tt-1< AC < JLl, such that 

i?o := VR(X ® (which has the same support D as i?) 

satisfies Tr C Ai?o = gl- By the Jamiolkowski isomorphism^! between quan- 
tum channels and states with maximally mixed reduction, this is equivalent to 
the existence of a quantum channel To such that Rq = (id ® To)$rf. Observe 
that i? < ±±^L> and Tr(i? A) < §Tr(i?A)- 

So, putting together the bounds of eqs. ©id)© an d ©> we g et , by 
the union bound, 

Prjcu {(Ro,D)} has error probability of 

second kind larger than A or violates eq. © j < — + N' exp(— g? 2 ) . 

If this is less than 1, there will exist a state Ro = (id®To)$d and an operator 
D enlarging the code and preserving the error probabilities as well as the 
properties in eq. ©, which contradicts maximality. 

Hence, N' > | exp(<i 2 ), and we are done. □ 

This readily proves, answering the above question affirmatively: 



Theorem 13 The identification capacity of a system in which entanglement 
(EPR pairs) between sender and receiver is available at rate E, and which 
allows (even only negligible) communication, is at least 2E. This is tight for 
the case that the available resources are only the entanglement and negligible 
communication. □ 
Remark 14 Just as the Ahlswede-Dueck construction of proposition II II can 
be understood as an application of random hashing, we are tempted to present 
our above construction as a kind of "quantum hashing" : indeed, the (small) 
quantum system transmitted contains, when held together with the other 
half of the prior shared entanglement, just enough of a signature of the func- 
tions/quantum channels used to distinguish them pairwise reliably. 



6 General prior correlation 

Proposition 1111 quantifies the identification capacity of shared randomness, 
and proposition 1121 does the same for shared (pure) entanglement. This of 
course raises the questions what the identification capacity of other, more 
general, correlations is: i.e., we are asking for code constructions and bounds 
if (negligible) quantum communication and n copies of a bipartite state u> 
between sender and receiver are available. 

For the special case that the correlation decomposes cleanly into entan- 
glement and shared randomness, 

w = ® \^M A ' ® \»M B \ 

with an arbitrary perfect classical correlation (between registers A' and B') 
distributed according to p and arbitrary pure entangled states we can 
easily give the answer (let the sender be in possession of AA' , the receiver of 
BB'): 

C m = H(p) + 2j2p^(^ B ); (10) 

here, H(p) is the entropy of the classical perfect correlation p; E{^> AB ) = 
S{^ A ) is the entropy of entanglement,^ with the reduced state ^ A = 
Tr B ^ AB . The achievability is seen as follows: by entanglement and random- 
ness concentration^ this state yields shared randomness and entanglement at 
rates R = H(p) and E — ^ p^E^^), respectively (without the need of 
communication — note that both users learn which entangled state they have 
by looking at the primed registers). Proposition ^] yields an identification 
code of rate 2E, while proposition 1151 below shows how to increase this rate 
by R. 

That the expression is an upper bound is then easy to see, along the 
lines of the argume nts giv en in our earlier paper for the capacity of a "hybrid 
quantum memory" . 
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Proposition 15 (WinterEHl) Let {(pi, Di) : i = 1. . . . , N} be an identifica- 
tion code on the quantum system TL with error probabilities Ai, A2 of first and 
second kind, respectively, and let Tic be a classical system of dimension M (by 
this we mean a Hibert space only allowed to be in a state from a distinguished 
orthonormal basis {|m)}«=i^- Then, for every e > 0, there exists an identi- 
fication code {(o~f,Df) : f = 1, . . . , TV'} on Tic <8> H with error probabilities 

Ai, A2 + e of first and second kind, respectively, and N' > (^N e ) AI . The f 
actually label functions (also denoted f) {1, . . . , M} — > {1, . . . , N}, such that 

a f = ®P/(fc). 

In other words, availability of shared randomness (fj, on the classical sys- 
tem Tic ) with an identification code allows us to construct a larger identifica- 
tion code. □ 

The general case seems to be much more complex, and we cannot offer an 
approximation to the solution here. So, we restrict ourselves to highlighting 
two questions for further investigation: 

1. What is the identification capacity of a bipartite state w, together with 
negligible communication? For noisy correlations, this may not be the 
right question altogether, as a look at work by Ahlswede and BalakirskjGI 
shows: they have studied this problem for classical binary correlations 
with symmetric noise, and have found that — as in common randomness 
theory^ — one ought to include a limited rate of communication and 
study the relation between this additional rate and the obtained identi- 
fication rate. Hence, we should ask: what is the identification capacity 
of lj plus a rate of C bits of communication? An obvious thing to do in 
this scenario would be to use part of this rate to do entangle ment dis- 
tillation of which the communication cost is known in principle!^ This 
gives entanglement as well as shared randomness, so one can use the con- 
structions above. It is not clear of course whether this is asymptotically 
optimal. 

2. In the light of the code enlargement proposition ^1 it would be most in- 
teresting to know if a stronger version of our proposition II 2^ theorem ITS1 
holds: Does entanglement of rate E increase the rate of a given identifi- 
cation code by 2E1 

7 Identification in the presence of feedback: 
quantum classical channels 

Feedback for quantum channels is a somewhat problematic issue, mainly be- 
cause the output of the channel is a quantum state, of which there is in general 
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no physically consistent way of giving a copy to the sender. In addition, it 
should not even be a "copy" for the general case that the channel outputs 
a mixed state (which corresponds to the distribution of the output), but a 
copy of the exact symbol the receiver obtained; so the feedback should estab- 
lish correlation between sender and receiver, and in the quantum case this 
appears to involve further choices, e.g. of basis. The approach taken in the 
small literat ure o n the issue of feedback in quantu m ch annels (see Fujiwara 
and Nagaokapl BowenP and Bowen and NagarajarP^) has largely been to 
look at active feedback, where the receiver decides what to give back to the 
sender, based on a partial evaluation of the received data. 

We will begin our study by looking at a subclass of channels which do not 
lead into any of these conceptual problems: quantum-classical (qc) channels, 
i.e., destructive measurements, have a completely classical output anyway, so 
there is no problem in augmenting every use of the channel by instantaneous 
passive feedback. 

Let a measurement POVM (M y ) y ^y be given; then its qc-channel is the 
map 

Trp^^TrCpM^lyXyl, 
v 

with an orthogonal basis (\y)) y of an appropriate Hilbert space J- ', say. We 
will denote this qc-channel as T : B(Tt) — ► y. 

For a qc-channels T, a (randomised) feedback strategy F for block n is 
given by states p t:J ,t-i on Hi for each t — 1, . . . , n and y t ~ 1 E y t : this is the 
state input to the channel in the t th timestep if the feedback from the previous 
rounds was y 4-1 — y\ . . . yt—i- Clearly, this defines an output distribution Q 
on y n by iteration of the feedback loop: 

n 

Q{y n ) = \[^{Pt:y^M yt ). (11) 

t-1 

Remark 16 We could imagine a more general protocol for the sender: an 
initial state oo could be prepared on an ancillary system H.a, and the feedback 
strategy is a collection $ of completely positive, trace preserving maps 

tpt : B(JT®(*- 1 ) ® H A ) ~^B(H a ®H), 

where T is the quantum system representing the classical feedback by states 
from an orthogonal basis: this map creates the next channel input and a new 
state of the ancilla (potentially entangled) from the old ancilla state and the 
feedback. 

This more general scheme allows for memory and even quantum correla- 
tions between successive uses of the channel, via the system TLa- However, 
the scheme has, for each "feedback history" y l ~ l up to time t, a certain state 
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Ut-i-.yt- 1 on H-A (starting with <jq), and consequently an input state Pt-. y t-i 
on Tt: 



Pt-.yt 



Vt-.yt = 




_1 | ® o-t-i :3 /'-i))j 



It is easy to check that the corresponding output distribution Q of this feed- 
back strategy according to our definition (see eq. Ullj l) is the same as for the 
original, more general feedback scheme. So, we do not need to consider those 
to obtain ultimate generality 

An (n, Ai, X2) -feedback ID code for the qc-channel T with passive feed- 
back is now a set {(Fj, Di) : i = 1, . . . , N} of feedback strategies Fi and of 
operators < Di < 1, such that the output states uji — Yl y n Qi{y n )\y n ){y n \ 
with the operators Di form an identification code with error probabilities Ai 
and A2 of first and second kind. Note that because the output is classical — 
i.e., the states are diagonal in the basis (\y n )) — , we may without loss of gener- 
ality assume that all D l = £ y „ Di{y n )\y n )(y n \, with certain < Di(y n ) < 1. 

Finally, let iVi?(n, Ai, A2) be the maximal N such that there exists an 
(n, Ai, A2)-feedback ID code with ./V messages. Note that due to the classical 
nature of the channel output codes are automatically simultaneous. 

To determine the capacity, we invoke the following result: 
Lemma 17 (Ahlswede and Dueck,^ Lemma 4) Consider a qc-channel 
T : B{TL) — > y and any randomised feedback strategy F for block n. Then, for 
e > 0, there exists a set E C y n of probability Q{£) > 1 — e and cardinality 



The proof of Ahlswede and DueclP' applies directly: a qc-channel with feed- 
back is isomorphic to a classical feedback channel with an infinite input al- 
phabet (the set of all states), but with finite output alphabet, which is the 



This is the essential tool to prove the following generalisation of Ahlswede's 
and Dueck's capacity resultpl 

Theorem 18 For a qc-channel T and Ai, A2 > 0, Ai + A2 < 1, 



unless the transmission capacity of T is 0, in which case C^,(T) = 0. 

In other words, the capacity of a nontrivial qc-channel with feedback is 
its maximum output entropy and the strong converse holds. 




where a = \y\e x / 2 . 



relevant fact. 



□ 



km -loglogiV>(7i,Ai, A 2 ) = CmCT) = maxH(T{p)), 
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Proof. Let's first get the exceptional case out of the way: C(T) can only be for 
a constant channel (i.e., one mapping every input to the same output). Clearly 
such a channel allows not only no transmission but also no identification. 

The achievability is explained in the paper of Ahlswede and DueckP^ the 
sender uses m = n — 0(1) instances of the channel with the state p each, 
which maximises the output entropy. Due to feedback they then share the 
outcomes of m i.i.d. random experiments, which they can concentrate into 
nH(T(p)) — o(n) uniformly distributed bits. (This is a bit simpler than in the 
original paper they just cut up the space into type classes.) The remaining 
O(l) uses of the channel (with an appropriate error correcting code) are then 
used to implement the identification code of proposition 1111 based on the 
uniform shared randomness. 

The strong converse is only a slight modification of the arguments of 
Ahlswede and DueckJ^ due to the fact that we allow probabilistic decoding 
procedures: first, for each message i in a given code, lemma IT7I gives us a 
set Si C y n of cardinality < K = exp (nmax p H(T(p)) + 3|y|e _1/2 V^) , with 
probability 1 — e/3 under the feedback strategy F i: where e := 1 — Ai — A 2 > 0. 
Now let c := |~|], and define new decoding rules by letting 



Di(y n ) :-- 



\\cDi{y n )\ tor y n e S h 
for y n Si. 



(I.e., round the density Di(y n ) down to the nearest multiple of 1/c within Si, 
and to without.) It is straightforward to check that in this way we obtain 
an (n, Ai + |e, A2) -feedback ID code. 

The argument is concluded by observing that the new decoding densities 
are (i) all distinct (otherwise Ai + |e + A2 > 1), and (ii) all have support 
<K = exp(nmax p i/(T( j o)) + 3|y|e~ 1 / 2 V^). Hence 

r n 2 nmax p ff (T(p)) + 0(v^) 



n<^ k )(c + ir < [(c+i)\y\ 

from which the claim follows. □ 



8 Identification in the presence of feedback: 
"coherent feedback channels" 

Inspired by the work of Harrow^^ we propose the following definition of "co- 
herent feedback" as a substitute for full passive feedback: by Stinespring's 
theorem we can view the channel T as an isometry U : Hi — > H2 <8> H-3, fol- 
lowed by the partial trace Tr3 over H.3 : T(p) = Tr3 (UpU*) . Coherent feedback 
is now defined as distributing, on input p, the bipartite state Q(p) '■— UpU* 
among sender and receiver, who get H.3 and H.2, respectively. 
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A coherent feedback strategy $ for block n consists of a system Ha, ini- 
tially in state o~o, and quantum channels 

-.B{n A ® nf -^b{h a ® nf (t - 1} ® Hi) , 

creating the i th round channel input from the memory in Ha and the previ- 
ous coherent feedback Tif^ t ~ 1 \ The output state on Hf n after n rounds of 
coherent feedback channel alternating with the tpt, is 



W - Tl H A ®Hf 



(0 o ip n o o ip„_i o • • • o 6 o (/?i)ctq 



where implicitly each is patched up by an identity on all systems different 
from Hi, and each ip t is patched up by an identity on H^ *\ 

Now, an (n, Ai, X2) -coherent feedback ID code for the channel T with 
coherent feedback consists of N pairs Z?j) of coherent feedback strategies 
$i (with output states u>i) and operators < Di < 1 on W®", such that the 
(u)i, A) form an (n, Ai, A 2 )-ID code on Hf n . 

As usual, we introduce the maximum size N of an (n, A 1, A2) -coherent 
feedback ID code, and denote it JVij? \ (n, Ai, A2). It is important to under- 
stand the difference to Np(n, Ai, A2) at this point: for the qc-channel, the 
latter refers to codes making use of the classical feedback of the measurement 
result, but coherent feedback — even for qc-channcls — creates entanglement 
between sender and receiver, which, as we have seen in section |5j allows for 
larger identification codes. 

We begin by proving the analogue of lemma IT71 
Lemma 19 Consider a quantum channel T : B{TLi) — » B{H.2) and any feed- 
back strategy $ on block n with output state lo on H.f n . Then, for e > 0, there 
exists a projector II on Tlf n with probability Tr(u>II) > 1 — e and rank 



rankll < exp yn max S(T(p)) + ct\pn 
where a = (dim7i2)e 

Proof. The feedback strategy determines the output state uj on Tif n , and we 
choose complete von Neumann measurements on each of the n tensor fac- 
tors: namely, the measurement M of an eigenbasis {\m y }) y of u, the entropy- 
maximising output state of T (which is unique, as easily follows from the strict 
concavity of S). 

Defining the qc-channel T := M o T (i.e., the channel T followed by 
the measurement M), we are in the situation of lemma 1171 with y = 
{1, . . . , dim 7^2}- Indeed, we can transform the given quantum feedback strat- 
egy into one based solely on the classical feedback of the measurement results, 
as explained in remark 1161 Note that the additional quantum information 
available now at the sender due to the coherent feedback does not impair the 
validity of the argument of that remark: the important thing is that the clas- 
sical feedback of the measurement results collapses the sender's state into one 
depending only on the message and the feedback. 
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By lemma [201 stated below, max p H(T(p)) = S(ui), so lemma [T7| gives 
us a set £ of probability Q(£) > 1 — e and \£\ < exp(nS(ui) + ay^nj. The 
operator 

n : = \ m yi)( m vi\® l m yJ( m yJ 

then clearly satisfies Tr(wll) = Q{£ ) > 1 — e, and rank II = |£| is bounded as 
in lemma IT7I □ 

Lemma 20 Let T : B(C dl ) — ► B(C d2 ) be a quantum channel and let p 
maximise S(T(p)) among all input states p. Denote w — T(p) (which is easily 
seen to be the unique entropy-maximising output state ofT), and choose a 
diagonalisation uj = y\ Xj\ej)(ej\. Then, for the channel T defined by 

f{p)=Y / \ej){e j \T{p)\e j ){e j \ 

3 

(i.e., T followed by dephasing of the eigenbasis of to), 

maxS(f(p)) = 5(5) = maxS , (T(p)). 

Proof. The inequality ">" is trivial because for input state p, T and T have 
the same output state. 

For the opposite inequality, let us first deal with the case that ui is strictly 
positive (i.e., is not an eigenvalue). The lemma is trivial if u) = gjl, so we 
assume uj ^ ^1 from now on. Observe that M := {T(p) : p state on C dl } is 
convex, as is the set S := {t state on C d2 : S(t) > S(uj)}, and that jVnS = 
{to}. Since we assume that ui is not maximally mixed, S is full-dimensional 
in the set of states, so the boundary dS = {r : S(t) — S(ui)} is a one- 
codimensional submanifold; from positivity of ui (ensuring the existence of 
the derivative of S) it has a (unique) tangent plane H at this point: 

H = j£ state on C d2 : Tr[(£ - w)VS , (w)] = oj. 

Thus, H is the unique hyperplane separating S from Af: 

S C H+ = j£ state on C d2 : Tr[(£ - w)VS'(w)] > oj, 

TV C H~ = |^ state on C da : Tr[(^ - 5)VS , (a3)] < oj. 

Now consider, for phase angles a — (ai, . . . , Oid a ), the unitary U a = 
J2j e * Qj \ e j)( e j\i which clearly stabilises S and leaves ui invariant. Hence, also 
H and the two halfspaces H + and H~ are stabilised: 

U a HU* = H, U a H+U* = H+, U a H-U* = H~. 
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In particular, U a j\fU* C H , implying the same for the convex hull of all 
these sets: 

conv | (J UaNK \cH-, 
Since this convex hull includes (for r e AT) the states 

3 

we conclude that for all p, f(p) E H~ , forcing S(f(p)) < 5(2). 

We are left with the case of a degenerate u: there we consider per- 
turbations T e = (1 — e)T + of the channel, whose output entropy is 
maximised by the same input states as T, and the optimal output state is 
ui e = (1 — e)u> + eJjH- These are diagonal in any diagonalising basis for w, so 

f e = (l-c)f + e£l. 

Now our previous argument applies, and we get for all /?, 

S(f e (p)) <5(2 £ ) < (l-e)5(5) + elogd 2 +if(e,l- e ). 

On the other hand, by concavity, 

S(f t (p)) > (l-e)S(f(p))+e\ogd 2 . 

Together, these yield for all p, 

S(T(p))<S(Z) + T !— e H(e,l-e), 

and letting e — » concludes the proof. □ 

We are now in a position to prove 

Theorem 21 For a quantum channel T and Ai, A2 > 0, Ai + A2 < 1, 
lira -loglogiV| F> (n,Ai,A 2 ) = C$(T) = 2max5(T( (0 )), 

unless the transmission capacity of T is 0, in which case Cj^(T) = 0. 

In other words, the capacity of a nontrivial quantum channel with coherent 
feedback is twice its maximum output entropy and the strong converse holds. 
Proof. The trivial channel is easiest, and the argument is just as in theorem ll8l 
Note just one thing: a nontrivial channel with maximal quantum feedback 
will always allow entanglement generation (either because of the feedback or 
because it is noiseless) , so — by tclcportation — it will always allow quantum 
state transmission. 

For achievability, the sender uses m = n — 0(\ogn) instances of the chan- 
nel to send one half of a purification ty p of the output entropy maximising 
state p each. This creates m copies of a pure state which has reduced state 
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T(p) at the receiver. After performing entanglement concentration^ which 
yields nS(T(p)) — o(n) EPR pairs, the remaining O(logn) instances of the 
channel are used (with an appropriate error correcting code and taking some 
of the entanglement for teleportation) to implement the construction of propo- 
sition^] based on the maximal entanglement. 

The converse is proved a bit differently than in theorem 1181 where we 
counted the discretised decoders: now we have operators, and discretisation 
in Hilbert space is governed by slightly different rules. Instead, we do the 
following: given an identification code with feedback, form the uniform prob- 
abilistic mixture <& of the feedback strategies <&i of messages i — formally, <f> = 
■h $>i. Its output state u> clearly is the uniform mixture of the output states 
uji corresponding to message i: uj — i cjj. With e = 1 — Ai — A2, lemma ITHl 
gives us a projector IT of rank K < exp(nmax p S(T(p)) + 48(dimH 2 ) 2 e-\/") 
such that Tr(wII) > 1 — ^(e/24) 2 . Thus, for half of the messages (which we 
may assume to be i = 1, .. ., [N/2\), Tr(w,n) > 1 - (e/24) 2 . 

Observe that the a;,; together with the decoding operators Di form an 
identification code on B(Ttf n ), with error probabilities of first and second 
kind Ai and A2, respectively. Now restrict all and Di (i < N/2) to the 
supporting subspace of II (which we identify with C^): 

Tr(wilT) 

This is now an identification code on B(C K ), with error probabilities of first 
and second kind bounded by Ai + |e and A2 + ^e, respectively, as a consequence 

of the gentle measurement lemma P^l namely, ^\\u>i — u>i\[i < he. So finally, 
we can invoke Proposition 11 of our earlier paper which bounds the size 
of identification codes (this, by the way, is now the discretisation part of the 
argument): 

N ( 5 \ /15 N 



< 



2 - Vl-Ai -e/3-A 2 -e/3 / 
and we have the converse. □ 

Remark 22 For cq-channels T : X — ► B(H) (a map assigning a state 
T(x) = p x to every element x from the finite set X), we can even study 
yet another kind of feedback (let us call it cq- feedback): fix purifications ty x 
of the p x , on TC ®TL\ then input of x G X to the channel leads to distribution of 
ty x between sender and receiver. In this way, the receiver still has the channel 
output state p x , but is now entangled with the sender. 

By the methods employed above we can easily see that in this model, the 
identification capacity is 

C[ D F (T) > max js (E P (^) • 
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Achievability is seen as follows: for a given P use a transmission code of 
rate I(P;T) = P( x )Px) ~ J2 X P{ x )S{Px) and with letter frequencies 

P in the codewords! ' ' This is used to create shared randomness of the 
same rate, and the cq feedback to obtain pure entangled states which are 
concentrated into EPR pairs Z at rate J2 X P{ x )E{^x) = J2x P( x )S(Px)'- then 
we use eq. lfTU|) . 

The (strong) converse seems to be provable by combining the approxi- 
mation of output statistics result of Ahlswede and WinteJ^I with a dimension 
counting argument as in our previous paper's^^l Proposition 11, but we won't 
follow on this question here. 

Remark 23 Remarkably, the coherent feedback identification capacity 
C m (T) of a channel is at present the only one we actually "know" in the 
sense that we have a universally valid formula which can be evaluated (it is 
single-letter); this is in marked contrast to what we can say about the plain 
(non-simultaneous) identification capacity, whose determination remains the 
greatest challenge of the theory. 
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